Picture for Anssi Kanervisto

Anssi Kanervisto

BFM-Zero: A Promptable Behavioral Foundation Model for Humanoid Control Using Unsupervised Reinforcement Learning

Add code
Nov 06, 2025
Viaarxiv icon

Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models

Add code
Apr 15, 2025
Viaarxiv icon

Fast Adaptation with Behavioral Foundation Models

Add code
Apr 10, 2025
Viaarxiv icon

Diffusion for World Modeling: Visual Details Matter in Atari

Add code
May 20, 2024
Figure 1 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 2 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 3 for Diffusion for World Modeling: Visual Details Matter in Atari
Figure 4 for Diffusion for World Modeling: Visual Details Matter in Atari
Viaarxiv icon

Toward Human-AI Alignment in Large-Scale Multi-Player Games

Add code
Feb 05, 2024
Figure 1 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 2 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 3 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Figure 4 for Toward Human-AI Alignment in Large-Scale Multi-Player Games
Viaarxiv icon

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Add code
Dec 05, 2023
Figure 1 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 2 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 3 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 4 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Viaarxiv icon

Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games

Add code
Dec 04, 2023
Viaarxiv icon

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Add code
Mar 23, 2023
Figure 1 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 2 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 3 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 4 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Jan 25, 2023
Figure 1 for Imitating Human Behaviour with Diffusion Models
Figure 2 for Imitating Human Behaviour with Diffusion Models
Figure 3 for Imitating Human Behaviour with Diffusion Models
Figure 4 for Imitating Human Behaviour with Diffusion Models
Viaarxiv icon

A2C is a special case of PPO

Add code
May 18, 2022
Figure 1 for A2C is a special case of PPO
Viaarxiv icon